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Abstract 

Given two rooted, labeled trees P and T the tree path subsequence problem is to determine which 
paths in P are subsequences of which paths in T. Here a path begins at the root and ends at a leaf. In 
this paper we propose this problem as a useful query primitive for XML data, and provide new algorithms 
improving the previously best known time and space bounds. 


1 Introduction 

We say that a tree is labeled if each node is assigned a character from an alphabet E. Given two sequences 
of labeled nodes p and t, we say that p is a subsequence of t, denoted p C t, if p can be obtained by removing 
nodes from t. Given two rooted, labeled trees P and T the tree path subsequence problem (TPS) is to 
determine which paths in P are subsequences of which paths in T. Here a path begins at the root and ends 
at a leaf. That is, for each path p in P we must report all paths t in T such that pQt. 

This problem was introduced by Chen [4] who gave an algorithm using 0(mm(lpnp + np,nplp + n t)) 
time and Oilpdp + np + up) space. Here, ns, Is, and ds denotes the number of nodes, number of leaves, 
and depth, respectively, of a tree S. Note that in the worst-case this is quadratic time and space. In this 
paper we present improved algorithms giving the following result: 

Theorem 1. For trees P and T the tree path subsequence problem can be solved in 0{np + np) space with 
the following running times: 

{ 0(lpnp + np), 

0(nplp + np), 

0(£f ^ + n T + n P log n P ). 

The first two bounds in Theorem 1 match the previous time bounds while improving the space to linear. 
This is achieved using a algorithm that resembles the algorithm of Chen [4]. At a high level, the algorithms 
are essentially identical and therefore the bounds should be regarded as an improved analysis of Chen’s 
algorithm. The latter bound is obtained by using an entirely new algorithm that improves the worst-case 
quadratic time. Specifically, whenever log np = 0(np/ log nr) the running time is improved by a logarithmic 
factor. Note that - in the worst-case - the number of pairs consisting of a path from P and a path T is 
fl(npnp), and therefore we need at least as many bits to report the solution to TPS. Hence, on a RAM 
with logarithmic word size our worst-case bound is optimal. Most importantly, all our algorithms use linear 
space. For practical applications this will likely make it possible to solve TPS on large trees and improve 
running time since more of the computation can be kept in main memory. 

*An extended abstract of this paper appeared in Proceedings of the 6th Italian Conference on Algorithms and Complexity, 
2006. 
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Figure 1: (a) The trie of queries 1,2,3, or the tree for query 4. (b) A fragment of a catalog of books. 
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1.1 Applications 

We propose TPS as a useful query primitive for XML data. The key idea is that an XML document D may 
be viewed as a rooted, labeled tree (different nodes can be assigned the same label). For example, suppose 
that we want to maintain a catalog of books for a bookstore. A fragment of a possible XML tree, denoted 
D, corresponding to the catalog is shown in Fig. 1(b). In addition to supporting full-text queries, such as 
find all documents containing the word “John”, we can also use the tree structure of the catalog to ask more 
specific queries, such as the following examples: 

1. Find all books written by John, 

2. find all books written by Paul, 

3. find all books with a chapter that has something to do with XML, or 

4. find all books written by John and Paul with a chapter that has something to do with XML. 

The queries 1,2, and 3 correspond to a path query on D , that is, compute which paths in D that contains 
a specific path as a subsequence. For instance, computing the paths in D that contain the path of three 
nodes labeled “book”, “chapter”, and “XML”, respectively, effectively answers query 3. Most XML-query 
languages, such as XPath [5], support such queries. 

Using a depth-first traversal of I? a path query can be solved in linear time. More precisely, if q is a 
path consisting of n q nodes, answering the path query on D takes 0{n q + tid) time. Hence, if we are given 
path queries qi,... ,qk we can answer them in 0(n qi + ■ ■ ■ + n qk + kno) time. However, we can do better by 
constructing the trie, Q, of q \,..., qu- The trie Q has labels on the nodes and is constructed such there is 
one node for every common prefix of q \,..., q^. Answering all path queries now correspond to solving TPS 
on Q and D. As an example the queries 1,2, and 3 form the trie shown in Fig. 1(a). As Iq < k , Theorem 1 
gives us an algorithm with running time 


O 



n qk + min 


(kn D + tiq, uqId + n D , 


riQn D 
log riD 


+ n D + uq log riQ 


(1) 


Since uq < n qi + ■ ■ ■ + n qk this is at least as good as answering the queries individually and better in many 
cases. If many paths share a prefix, i.e., queries 1 and 2 share ’’book” and ’’author”, the size of uq can 
be much smaller than n qi + • • • + n qk . Using our solution to TPS we can efficiently take advantage of this 
situation since the latter two terms in (1) depend on tiq and not on n qi + • • • + n qk . 

Next consider query 4. This query cannot be answered by solving a TPS problem but is an instance of 
the tree inclusion problem (TI). Here we want to decide if P is included in T, that is, if P can be obtained 
from T by deleting nodes of T. Deleting a node y in T means making the children of y children of the parent 
of y and then removing y. It is straightforward to check that we can answer query 4 by deciding if the tree 
in Fig. 1(a) can be included in the tree in Fig. 1(b). 
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Recently, TI has been recognized as an important XML query primitive and has recieved considerable 
attention, see e.g., [10-15]. Unfortunately, TI is NP-complete in general [9] and therefore the existing 
algorithms are based on heuristics. Observe that a necessary condition for P to included in T is that all 
paths in P are subsequences of paths in T. Hence, we can use TPS to quickly identify trees or parts of trees 
that cannot be included T. We believe that in this way TPS can be used as an effective ’’filter” for many 
tree inclusion problems that occur in practice. 

We note that for ordered trees, that is, a left-to-right ordering among siblings is given, the tree inclusion 
problem can be solved in polynomial time [3,9]. In this case deleting a node y inserts the children of y in 
the place of in the left-to-right order among the siblings of y. 

1.2 Technical Overview 

Given two strings (or labeled paths) a and b , it is straightforward to determine if a is a subsequence of b by 
scanning the character from left to right in b. This uses 0(|a| + |6|) time. We can solve TPS by applying this 
algorithm to each of the pair of paths in P and T, however, this may use as much as 0(npnp(np + np)) time. 
Alternatively, Baeza-Yates [2] showed how to preprocess b in 0{\b\ log |6|) time such that testing whether a 
is a subsequence of b can be done in 0(|a| log |6|) time. Using this data structure on each path in T we can 
solve the TPS problem, however, this may take as much as 0(n^ log nr + n 2 p\ognr)- Hence, none of the 
availiable subsequence algorithms on strings provide an immediate efficient solution to TPS. 

Inspired by the work of Chen [4] we take another approach. We provide a framework for solving TPS. 
The main idea is to traverse T while maintaining a subset of nodes in P, called the state. When reaching a 
leaf z in T the state represents the paths in P that are subsequences of the path from the root to z. At each 
step the state is updated using a simple procedure processing a subset of nodes. The result of Theorem 1 
is obtained by taking the best of two algorithms based on our framework: The first one uses a simple data 
structure to maintain the state. This leads to an algorithm using 0(mm(lpnp + np, nplp + tit)) time. At a 
high level this algorithm resembles the algorithm of Chen [4] and achieves the same running time. However, 
we improve the analysis of the algorithm and show a space bound of 0(np + riT). This should be compared to 
the worst-case quadratic space bound of 0(lpdr + np +nr) given by Chen [4]. Our second algorithm takes a 
different approach combining several techniques. Starting with a simple quadratic time and space algorithm, 
we show how to reduce the space to 0(np log rip) using a decomposition of T into disjoint paths. We then 
divide P into small subtrees of logarithmic size called micro trees. The micro trees are then preprocessed 
such that subsets of nodes in a micro tree can be maintained in constant time and space. Intuitively, this 
leads to a logarithmic improvement of the time and space bounds. 

1.3 Notation and Definitions 

In this section we define the notation and definitions we will use throughout the paper. For a graph G we 
denote the set of nodes and edges by V[G) and E[G), respectively. Let T be a rooted tree. The root of T 
is denoted by root(T). The size of T, denoted by rip, is |V(T)|. The depth of a node y G V(T), depth(y), is 
the number of edges on the path from y to root(T) and the depth of T, denoted dp, is the maximum depth 
of any node in T. The parent of y is denoted parent (y). A node with no children is a leaf and otherwise 
it is an internal node. The number of leaves in T is denoted Ip. Let T(y) denote the subtree of T rooted 
at a node y € V(T). If z € V(T(y)) then y is an ancestor of z and if z G V(T(y))\{y} then y is a proper 
ancestor of 2 . If y is a (proper) ancestor of z then 2 is a (proper) descendant of y. We say that T is labeled 
if each node y is assigned a character, denoted label (y), from an alphabet X. The path from y to root(T), of 
nodes root(T) = t/i,..., yu = y is denoted path(y). Hence, we can formally state TPS as follows: Given two 
rooted tree P and T with leaves X\,... ,x r and y i,..., y s , respectively, determine all pairs (i,j) such that 
path(:Ei) U path(yj). For simplicity we will assume that leaves in P and T are always numbered as above 
and we identify each of the paths by the number of the corresponding leaf. 

Throughout the paper we assume a unit-cost RAM model of computation with word size 0(log?rT) and a 
standard instruction set including bitwise boolean operations, shifts, addition and multiplication. All space 
complexities refer to the number of words used by the algorithm. 
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Figure 2: The letters inside the nodes are the labels, and the identifier of each node is written outside the 
node. Initially we have X = {root(P)}. Since label(root(P)) = a = label(root(T)) we replace root(P) 
with is children and get X mot (x) = {xi,X 2 }- Since label(l) = label(a;i) 7 ^ labelfA^) we get X\ = {x 3 ,X 2 }- 
Continuing this way we get X 2 = {_Li, X2}, X 3 = {_Li, -L 2 }, X 4 = { 2 : 3 , -L 2 }, and X 5 = { 2 : 3 , ± 2 }- The nodes 
3 and 5 are leaves of T and we thus report paths 1 and 2 after computing X 3 and path 2 after computing 
X 5 . 


2 A Framework for solving TPS 

In this section we present a simple general algorithm for the tree path subsequence problem. The key 
ingredient in our algorithm is the following procedure. For any X C V (P) and y G V ( T ) define: 

Down(X, y): Return the set Child({;e G X | label(a;) = label(y)}) U {x € X | label(:r) 7 ^ label(y)}. 

The notation Child(X) denotes the set of children of X. Hence, Down(X, y) is the set consisting of nodes 
in X with a different label than y and the children of the nodes X with the same label as y. We will now 
show how to solve TPS using this procedure. 

First assign a unique number in the range {1,..., Ip} to each leaf in P. Then, for each i, 1 < i < Ip, add 
a pseudo-leaf _Lj as the single child of the *th leaf. All pseudo-leaves are assigned a special label f3 ^ £. The 
algorithm traverses T in a depth first order and computes at each node y the set X y . We call this set the 
state at y. Initially, the state consists of {root(P)}. For 2 G child(y), the state X z can be computed from 
state X y as 

x z = DowN(X.y, z). 

If z is a leaf we report the number of each pseudo-leaf in X, as the paths in P that are subsequences of 
path(z). See Figure 2 for an example. To show the correctness of this approach we need the following lemma. 

Lemma 1. For any node y G V(T) the state X y satisfies the following property: 

x G X y => path(parent(a:)) C path(y) . 

Proof. By induction on the number of iterations of the procedure. Initially, X = {root(P)} satisfies the 
property since root(P) has no parent. Suppose that X y is the current state and z G child(j/) is the next 
node in the depth first traversal of T. By the induction hypothesis X y satisfies the property, that is, for any 
x G X y , path(parent(a;)) C path(f/)). Then, 

X z = DowN(X y , z) = Child({;t G X y | label(x) = label( 2 )}) U{iG X y \ label(a;) 7 ^ label( 2 )} . 

Let 1 be a node in X y . There are two cases. If label(a:) = label(z) then path(a;) C path(^) since 
path(parent(a:)) C path(y). Hence, for any child x' of x we have path(parent(a/)) C path( 2 :). On the 
other hand, if label(a:) 7 ^ label( 2 :) then x G X z . Since y = parent(z) we have path(j/) C path( 2 ), and hence 
path(parent(x)) C path(y) C path(a:). □ 

By the above lemma all paths reported at a leaf z G V(T ) are subsequences of path(^). The following 
lemma shows that the paths reported at a leaf z G V(T) are exactly the paths in P that are subsequences 
of pathfA). 
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Lemma 2. Let z be a leaf in T and let _U be a pseudo-leaf in P. Then, 

1 i £ X z path(parent(_Lj)) U path(^) . 

Proof. It follows immediately from Lemma 1 that _U £ X z => path(parent(_U)) C path(z). It remains 

to show that path(parent(_L,)) C path(z) => X* £ X z . Let path(z) = z±,...,Zk, where z\ = root(T) 

and Zk = z, and let path(parent(_Lj)) = yi,...,ye, where y\ = root(P) and ye — parent(_U). Since 
path (parent (_U)) C path(z) there are nodes Zj t = ye for 1 < i < k, such that (i) ji < ji +1 and (ii) there 

exists no node Zj with label(zj) = label(yi), where ji -1 < j < ji- Initially, X = (root(P)}. We have 

root(P) £ X Zj for all j < j\, since z^ is the first node on path(z) with label label(root(P)). When we get to 
Zj L , root(P) is removed from the state and y 2 is inserted. Similarly, y t is in all states X Zj for ji_i < j < ji. 
It follows that JLj is in all states X z , where j > je and thus _Lj £ X Zk = X z . □ 

The next lemma can be used to give an upper bound on the number of nodes in a state. 

Lemma 3. For any y £ V(T) the state X y has the following property: Let x £ X y . Then no ancestor of x 
is in X y . 

Proof. By induction on the length of path(y). Initially, the state only contains root(P). Let z be the 
parent of y, and thus X y is computed from X z . First we note that for all nodes x £ X y either x £ X z 

or parent (a;) £ X z . \i x £ X z it follows from the induction hypothesis that no ancestor of x is in X z , and 

thus no ancestors of x can be in X y . If parent(a:) £ X, then due to the definition of Down we must have 
label(a;) = label(j/). It follows from the definition of Down that parent(:r) ^ X y . □ 

It follows from Lemma 3 that \X y \ < Ip for any y £ V(T). If we store the state in an unordered linked 
list each step of the depth-first traversal takes time 0(lp) giving a total Olfpnp + np) time algorithm. Since 
each state is of size at most Ip the space used is 0(np + Ipnp). In the following sections we show how to 
improve these bounds. 


3 A Simple Algorithm 

In this section we consider a simple implementation of the above algorithm, which has running time 
O (mm(lpnT + np,nplp + it-t)) and uses 0(np + np) space. We assume that the size of the alphabet 
is rip + np and each character in S is represented by an integer in the range {1,. .., np + np}. If this is not 
the case we can sort all characters in V (P) U V(T) and replace each label by its rank in the sorted order. 
This does not change the solution to the problem, and assuming at least a logarithmic number of leaves in 
both trees it does not affect the running time. To get the space usage down to linear we will avoid saving 
all states. For this purpose we introduce the procedure Up, which reconstructs the state X, from the state 
X y , where z = parent(y). We can thus save space as we only need to save the current state. 

We use the following data structure to represent the current state X y : A node dictionary consists of two 
dictionaries denoted X c and X p . The dictionary X c represents the node set corresponding to X y , and the 
dictionary X p represents the node set corresponding to the set {x £ X z \ x ^ X y and 2 is an ancestor of y}. 
That is, X c represents the nodes in the current state, and X p represents the nodes that is in a state X z , 
where z is an ancestor of y in T, but not in X y . We will use X p to reconstruct previous states. The 
dictionary X c is indexed by S and X p is indexed by V(T). The subsets stored at each entry are represented 
by doubly-linked lists. Furthermore, each node in X c maintains a pointer to its parent in X p and each node 
x' in X p stores a linked list of pointers to its children in X p . With this representation the total size of the 
node dictionary is 0(np + np). 

Next we show how to solve the tree path subsequence problem in our framework using the node dictionary 
representation. For simplicity, we add a node T to P as a the parent of root(P). Initially, the X p represents 
T and X c represents root(P). The Down and Up procedures are implemented as follows: 

Down((X p , X c ), y): 1. Set X := X c [label(y)] and X c [label(j/)] := 0. 
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2. For each x G X do: 

(a) Set XP[y\ := X?[y] U {x}. 

(b) For each x' G child(x) do: 

i. Set X c [label(x')] := X c [label(x')] U {x}. 

ii. Create pointers between x' and x. 

3. Return ( XP,X C ). 

Up((Xp, X c ), y): 1. Set X := XP[y\ and XP[y] := 0. 

2. For each x G X do: 

(a) Set X c [label(x)] := X c [label(x)] U {x}. 

(b) For each x' G child(x) do: 

i. Remove pointers between x' and x. 

ii. Set X c [label(x')] := X c [label(x')] \ {x'|. 

3. Return ( X P ,X C ). 

The next lennna shows that Up correctly reconstructs the former state. 

Lemma 4. Let X z = ( X C ,X P ) be a state computed at a node z G V(T), and let y be a child of z. Then, 

X z = Up(Down(X z , y), y) . 

Proof. Let (Xf,Xf) = DoWN(X z ,y) and (Xf,Xf) = Up((Xf,Xf), y). We will first show that x G X z => 
x G Up(Down (X z ,y),y). 

Let x be a node in X c . There are two cases. If x G X c [label(y)], then it follows from the implementation 
of Down that x G Xf[y]. By the implementation of Up, x G Xf [y] implies x G Xf. If x ^ X c [label(y)] 
then x G Xf. We need to show parent(x) ^ Xf[y], This will imply x G Xf, since the only nodes removed 
from Xf when computing Xf are the nodes with a parent in X^[y], Since y is unique it follows from the 
implementation of Down that parent(x) G Xf implies x G X c [label(y)]. 

Let x be a node in X p . Since y is unique we have x G X p [y'\ for some y' y. It follows immediately 
from the implementation of Up and Down that X p [y'] = Xf [y'\ = X<f[y'\, when y' ^ y, and thus X p = Xlf. 

We will now show x G Up(Down(X z , y), y) =>■ x G X z . Let x be a node in Xf. There are two cases. If 
x fL X1 then it follows from the implementation of Up that x G Xf [y]. By the implementation of Down, 
x G Xf[y] implies x G X c [label(y)], i.e., x G X°. If x G Xf then by the implementation of Up, x G Xf 
implies parent(x) ^ xf [y] . It follows from the implementation of Down that x G X°. Finally, let x be a 
node in Xf. As argued above X p = Xf, and thus x G X p . □ 

From the current state X y = (X C ,X P ) the next state X z is computed as follows: 


X z 


DoWN(X y , z) if y = parent(z), 
Up(X y , y) if z = parent(y). 


The correctness of the algorithm follows from Lemma 2 and Lennna 4. We will now analyze the running 
time of the algorithm. The procedures Down and Up uses time linear in the size of the current state and 
the state computed. By Lennna 3 the size of each state is 0(lp). Each step in the depth-first traversal thus 
takes time 0(/p), which gives a total running time of 0(lpnx + np). On the other hand consider a path t in 
T. We will argue that the computation of all the states along the path takes total time 0(np + nf), where 
ut is the number of nodes in t. To show this we need the following lennna. 


Lemma 5. Let t be a path in T. During the computation of the states along the path t, any node x G V(P) 
is inserted into X c at most once. 
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Proof. Since t is a path we only need to consider the Down computations. The only way a node x & V (P) 
can be inserted into X c is if parent(:r) € X c . It thus follows from Lemma 3 that x can be inserted into X c 
at most once. □ 

It follows from Lemma 5 that the computations of the all states when T is a path takes time 0(np + np). 
Consider a path-decomposition of T. A path-decomposition of T is a decomposition of T into disjoint paths. 

We can make such a path-decomposition of the tree T consisting of It paths. Since the running time of 
Up and Down both are linear in the size of the current and computed state it follows from Lemma 4 that 
we only need to consider the total cost of the Down computations on the paths in the path-decompostion. 
Thus, the algorithm uses time at most Y^ter 0(n p + n t ) — 0(nplr + np). 

Next we consider the space used by the algorithm. Lemma 3 implies that \X C \ < Ip. Now consider the 
size of X p . A node is inserted into X p when it is removed from X c . It is removed again when inserted into 
X c again. Thus Lemma 5 implies \X P \ < np at any time. The total space usage is thus 0(np + np). To 
summarize we have shown, 

Theorem 2. For trees P and T the tree path subsequence problem can be solved in O (min (Ipnp + np , nplp + np)) 
time and 0(np + np) space. 

4 A Worst-Case Efficient Algorithm 

In this section we consider the worst-case complexity of TPS and present an algorithm using subquadratic 
running time and linear space. The new algorithm works within our framework but does not use the Up 
procedure or the node dictionaries from the previous section. 

Recall that using a simple linked list to represent the states we immediately get an algorithm using 
0{npnp) time and space. We first show how to modify the traversal of T and discard states along the 
way such that at most 0(log rip) states are stored at any step in the traversal. This improves the space to 
0(np log np). Secondly, we decompose P into small subtrees, called micro trees , of size 0(\ognp). Each 
micro tree can be represented in a single word of memory and therefore a state uses only 0 ( lc) g^ T ) space. 

In total the space used to represent the Oflognp) states is 0{ lo ^ T ■ log np) = 0(np + log nr)- Finally, 
we show how to preprocess P in linear time and space such that computing the new state can be done in 
constant time per micro tree. Intuitively, this achieves the 0(\ognp) speedup. 

4.1 Heavy Path Traversal 

In this section we present the modified traversal of T. We first partition T into disjoint paths as follows. For 
each node y € V(T) let size(y) = \V(T{y))\. We classify each node as either heavy or light as follows. The 
root is light. For each internal node y we pick a child z of y of maximum size among the children of y and 
classify 3 as heavy. The remaining children are light. An edge to a light child is a light edge, and an edge 
to a heavy child is a heavy edge. The heavy child of a node y is denoted heavy (y). Let lightdepth(f/) denote 
the number of light edges on the path from y to root(T). 

Lemma 6 (Harel and Tarjan [8]). For any tree T and node y G V{T). lightdepth(y) < log np + 0(1). 

Removing the light edges, T is partitioned into heavy paths. We traverse T according to the heavy paths 
using the following procedure. For node y £ V(T) define: 

Visit(j/): 1. If y is a leaf report all leaves in X y and return. 

2. Else let y \,..., yk be the light children of y and let z = heavy (y). 

3. For i := 1 to k do: 

(a) Compute X yi := DoWN(X y , yf) 
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(b) Compute Visit (y,). 

4. Compute X z := Down(X 9 ,x). 

5. Discard X y and compute Visit(z). 

The procedure is called on the root node of T with the initial state {root(P)}. The traversal resembles a 
depth first traversal, however, at each step the light children are visited before the heavy child. We therefore 
call this a heavy path traversal. Furthermore, after the heavy child (and therefore all children) has been 
visited we discard X y . At any step we have that before calling Visit (y) the state X y is availiable, and 
therefore the procedure is correct. We have the following property: 

Lemma 7. For any tree T the heavy path traversal stores at most log nr + 0(1) states. 

Proof. At any node y £ V ( T ) we store at most one state for each of the light nodes on the path from y to 
root(T). Hence, by Lemma 6 the result follows. □ 

Using the heavy-path traversal immediately gives an 0{npnr) time and 0(np log nr) space algorithm. 
In the following section we improve the time and space by an additional Oflognp) factor. 

4.2 Micro Tree Decomposition 

In this section we present the decomposition of P into small subtrees. A micro tree is a connected subgraph of 
P. A set of micro trees MS is a micro tree decomposition iff V (P) = Umgms V(M) and for any M, M' £ MS, 
(V(M)\{root(M)}) n (V(M')\{root(M')}) = 0. Hence, two micro trees in a decomposition share at most 
one node and this node must be the root in at least one of the micro trees. If root(M') £ V(M) then M is 
the parent of M' and M' is the child of M. A micro tree with no children is a leaf and a micro tree with no 
parent is a root. Note that we may have several root micro trees since they can overlap at the node root(P). 
We decompose P according to the following classic result: 

Lemma 8 (Gabow and Tarjan [ 6 ]). For any tree P and parameter s > 1, it is possible to build a micro tree 
decomposition MS of P in linear time such that \MS\ = 0(\np/s ]) and \V(M)\ < s for any M £ MS 

4.3 Implementing the Algorithm 

In this section we show how to implement the Down procedure using the micro tree decomposition. First 
decompose P according to Lemma 8 for a parameter s to be chosen later. Hence, each micro tree has at 
most s nodes and \MS\ = 0{\rip/s]). We represent the state X compactly using a bit vector for each micro 
tree. Specifically, for any micro tree M we store a bit vector Xm = [ 6 i,..., 6 S ], such that X m [i] = 1 iff the 
ith node in a preorder traversal of M is in X. If |V(M)| < s we leave the remaining values undefined. Later 
we choose s = ©(logiTr) such that each bit vector can be represented in a single word. 

Next we define a DowNm procedure on each micro tree M £ MS. Due to the overlap between micro 
trees the DowNm procedure takes a bit b which will be used to propagate information between micro trees. 
For each micro tree M £ MS, bit vector Xm, bit b, and y £ V(T) define: 

DowNm( Xm, b, y): Compute the state X' M := Child({x G Xm | label(a:) = label(y)}) U {x £ Xm \ 
label(a;) ^ label(y)}. If b = 0, return X' M , else return X' M U {root(M)}. 

Later we will show how to implemenent DowNm in constant time for s = ©(logiTr). First we show how to 
use DowNm to simulate Down on P. We define a recursive procedure Down which traverse the hiearchy 
of micro trees. For micro tree M, state X, bit b, and y £ V(T) define: 

Down(V, M, b, y): Let Mi,..., Mf. be the children of M. 

1. Compute Xm := DowNm(Tm,6 , y). 

2. For i := 1 to k do: 



(a) Compute Down (X,Mi,bi,y), where 6* = 1 iff 
root(Mj) € Ijf. 

Intuitively, the Down procedure works in a top-down fashion using the 6 bit to propagate the new state of the 
root of micro tree. To solve the problem within our framework we initially construct the state representing 
{root(P)}. Then, at each step we call Down(I?j , 0, y) on each root micro tree Rj. We formally show that 
this is correct: 

Lemma 9. The above algorithm correctly simulates the Down procedure on P. 

Proof. Let X be the state and let X' := Down(X, y). For simplicity, assume that there is only one root 
micro tree R. Since the root micro trees can only overlap at root(P) it is straightforward to generalize the 
result to any number of roots. We show that if X is represented by bit vectors at each micro tree then calling 
Down(P, 0,y) correctly produces the new state X'. 

If R is the only micro tree then only line 1 is executed. Since 6 = 0 this produces the correct state by 
definition of Down^. Otherwise, consider a micro tree M with children Mi ,..., M k and assume that 6=1 
iff root(M) £ X'. Line 1 computes and stores the new state returned by Down^. If 6 = 0 the correctness 
follows immediately. If 6 = 1 observe that Down^ first computes the new state and then adds root(M). 
Hence, in both cases the state of M is correctly computed. Line 2 recursively computes the new state of the 
children of M. □ 

If each micro tree has size at most s and DowNm can be computed in constant time it follows that 
the above algorithm solves TPS in 0(|~np/s~|) time. In the following section we show how to do this for 
s = ©(lognp), while maintaining linear space. 

4.4 Representing Micro Trees 

In this section we show how to preprocess all micro trees M £ MS such that DowNm can be computed 
in constant time. This preprocessing may be viewed as a “Four Russian Technique” [1]. To achieve this in 
linear space we need the following auxiliary procedures on micro trees. For each micro tree M, bit vector 
Xm, and a £ S define: 

ChilDm(Wm): Return the bit vector of nodes in M that are children of nodes in Xm- 
Eq m (c»): Return the bit vector of nodes in M labeled a. 

By definition it follows that: 


Down m (Wm, 6, y) 


{ Child m (X m n EQ M (label(y))) U 

(Xm\(X m n EQ M (labe%))) if 6 = 0, 

Child m (X m n EQ M (label(y))) U 
(X M \(X M fl EQ M (labe%))) U (root(M)} if 6 = 1. 


Recall that the bit vectors are represented in a single word. Hence, given ChilDm and Eq m we can compute 
DowNm using standard bit-operations in constant time. 

Next we show how to efficiently implement the operations. For each micro tree M £ MS we store the 
value EQ M (a) in a hash table indexed by a. Since the total number of different characters in any M £ MS 
is at most s, the hash table Eq m contains at most s entries. Hence, the total number of entries in all 
hash tables is O(np). Using perfect hashing we can thus represent Eq m for all micro trees, M £ AIS, in 
0(np) space and 0(1) worst-case lookup time. The preprocessing time is expected 0(np) w.h.p.. To get a 
worst-case bound we use the deterministic dictionary of Hagerup et. al. [7] with 0((np) log(np)) worst-case 
preprocessing time. 

Next consider implementing ChilDm- Since this procedure is independent of the labeling of M it suffices 
to precompute it for all topologically different rooted trees of size at most s. The total number of such trees 
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is less than 2 2s and the number of different states in each tree is at most 2 s . Therefore Childm has to be 
computed for a total of 2 2s • 2 s = 2 3s different inputs. For any given tree and any given state, the value of 
ChilDm can be computed and encoded in O(s) time. In total we can precompute all values of Childm in 
0(s2 3s ) time. Choosing the largest s such that 3s + logs < r it (hence s = 0 (log nr)) we can precompute 
all values of Childm in 0(s2 3s ) = 0{np) time and space. Each of the inputs to Childm are encoded in a 
single word such that we can look them up in constant time. 

Finally, note that we also need to report the leaves of a state efficiently since this is needed in line 1 in 
the VlSiT-procedure. To do this compute the state L corresponding to all leaves in P. Clearly, the leaves of 
a state X can be computed by performing a bitwise AND of each pair of bit vectors in L and X. Computing 
L uses 0{np) time and the bitwise AND operation uses 0(\np/s\) time. 

Combining the results, we decompose P, for s as described above, and compute all values of Eq m and 
Child m- Then, we solve TPS using the heavy-path traversal. Since s = Qifognp), from Lemmas 7 and 8 
we have the following theorem: 

Theorem 3. For trees P andT the tree path subsequence problem can be solved in 0( +np + np log np) 

time and 0(np + np) space. 

Combining the results of Theorems 2 and 3 proves Theorem 1. 
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